Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: D Kavya, D Kiran Kumar, Divya Anjali M, Ganesh A P
DOI Link: https://doi.org/10.22214/ijraset.2025.66243
Certificate: View Certificate
Access to clear health information is crucial, many people in rural and underserved areas encounter challenges such as complex medical terminology, language barriers, and a shortage of healthcare providers. These obstacles can result in confusion, delays, and poor health outcomes. A multilingual AI chatbot can simplify medical information, provide real-time assistance, and offer guidance in the user\'s preferred language. This solution can enhance access to healthcare, bridge existing gaps, and empower individuals to make informed health choices.
I. INTRODUCTION
Healthcare information accessibility is a crucial instrument of empowerment, enabling individuals to make informed health decisions. However, many people face significant challenges in understanding medical reports and symptoms, particularly in rural and underserved communities that lack access to proper healthcare facilities. Language barriers and limited access to service providers, combined with low medical literacy, further complicate this issue. Consequently, there exists a substantial gap in health accessibility, leaving many individuals unable to comprehend important health information or to take timely medical action. Despite significant advancements in artificial intelligence (AI) and natural language processing (NLP), a large portion of the population remains excluded from these benefits. Medical reports are often filled with technical jargon that can be difficult for those unfamiliar with medical terminology to understand. Additionally, the wealth of information available online regarding symptoms can be unreliable, overwhelming, and frequently misleading. The dominance of major world languages in medical reports and tools further restricts access for speakers of global dialects and local languages, making it difficult for them to engage with the information. To address these challenges, we propose an innovative solution that leverages AI-powered summarization of medical reports through a multilingual chatbot.
This system will allow users to upload health reports in PDF format, which it will then process to generate simplified, easy-to-understand summaries of diagnoses and findings. The chatbot will also enable users to ask questions related to their symptoms, receive accurate information about diseases and their causes, and interact in their preferred language, thereby promoting accessibility and inclusivity. We aim to bridge the complex health sector information gap for end-users by employing advanced NLP models, such as Bio-BERT. This initiative will enhance individuals' health decisions while facilitating a more equitable distribution of healthcare resources. Additionally, it will encourage preventive care by providing timely advice from healthcare professionals regarding their health. In the long term, this project aims to transform access to and understanding of healthcare information. Improved health literacy can lead to better patient outcomes and a more equitable healthcare system. By prioritizing inclusivity and accessibility, this initiative aspires to create a world where everyone can understand their health and take charge of their well-being effectively
II. LITERATURE REVIEW
III. PROBLEM DEFINITION
Understanding medical reports and accessing accurate health information is inherently complex, further complicated by the use of technical jargon, language barriers, and limited access to healthcare professionals. Individuals, especially those without medical expertise, often struggle to interpret these reports accurately, relying on limited resources that can vary widely in credibility. The consequences include delays in understanding diagnoses, misinformed health decisions, and increased stress and anxiety.
Existing tools for medical information, such as online platforms and apps, suffer from significant limitations. These systems often provide generic explanations based on static datasets, which fail to account for individual linguistic needs, cultural contexts, or real-time scenarios. They also lack actionable features, such as offering personalized guidance based on specific symptoms or medical reports.
This project aims to address these challenges by developing a multilingual AI-powered system that:
By integrating technology with health literacy and cultural sensitivity, this system seeks to revolutionize healthcare access, empowering individuals to make informed decisions, reduce anxiety, and bridge critical gaps in health communication.
IV. MODULE DESCRIPTION
The proposed system is built using multiple modules:
A. Report Upload Module
This module lets users upload medical reports in PDF format via a web interface, ensuring seamless input for further processing. It validates file types and sizes for security and initiates preprocessing for downstream tasks.
B. PDF Extraction Module
Extracts plain text from uploaded PDF files using PyPDF2, handling multi-page documents and ensuring clean text formatting. It also detects and removes non-text elements like images or annotations for better processing.
C. Summary Generation Module:
Generates concise summaries of extracted text using a fine-tuned Bio-BERT model for domain-specific abstractive summarization. It processes input text through tokenization and embeddings for natural language output.
D. Query Answering Module
Allows users to ask context-specific questions about their medical reports, leveraging Bio-BERT for accurate responses. It supports various query types, such as symptoms or diagnosis-related questions, ensuring high accuracy.
E. Database Management Module
Stores uploaded reports, generated summaries, and user interactions in a Firebase database. It enables retrieval of past summaries and answers, ensuring seamless access for logged-in users.
F. User Interface Module
Provides a responsive web interface for uploading reports, viewing summaries, and interacting with query responses. It maintains a session-based, chat-like interaction log for a user-friendly experience.
V. RESULTS AND EVALUATION
The System achieved over 90% accuracy in text extraction for standard PDFs, with a slight drop for low-quality scans or handwritten formats. The Bio-BERT-based summarization model retained critical medical terms, matching manually created summaries effectively. The question-answering module provided accurate, context-specific responses, while multilingual support via Google Translate ensured accessibility with preserved medical terminology across languages. These results highlight the system's robustness and efficiency.
Fig1: Sign-in
Fig2: Home
Fig3: Summary
The Medical Report Summarization System is an advanced solution designed to automate the analysis and comprehension of medical data, emphasizing accessibility and precision. Powered by Bio-BERT, a natural language processing model fine-tuned on biomedical datasets, the system extracts and summarizes crucial details such as diagnoses, symptoms, and treatments from PDF- format medical reports. Key features include multilingual translation support, enabling users to access summaries and responses in their preferred language, and text-to-speech functionality, enhancing usability for individuals with reading or visual impairments. With its scalable PDF-exclusive architecture, the system ensures efficient and reliable processing of digitally generated reports, it demonstrates exceptional performance in real-world testing by improving the speed, accuracy, and accessibility of medical report handling. By integrating summarization, translation, and accessibility tools, the system empowers healthcare professionals and patients to make well-informed decisions effectively.
[1] M. Abu et al, “MediGPT: Exploring Potentials of Conventional and Large Language Models on Medical Data”, IEEE Access, 2024. [2] T Nadarzynski, O Miles, A Cowie, and D Ridge, “Acceptability of artificial intelligence (AI)-led chatbot services in healthcare: A mixed-methods study”, Sage Journals, 2019. [3] A Goel, Satyam, and S Sharma, “Artificial Intelligence based Healthcare Chat Bot System”, 8th International Conference on Communication and Electronics Systems (ICCES), Coimbatore, India, 2023. [4] Kidwai, Bushra R K, Nadesh, “Design and Development of Diagnostic Chabot for supporting Primary Health Care Systems”, Elsevier, 2020. [5] K Jayashree, Monika K A, Preetha R., and Piraisoodan S P, “The Smart Health Care Prediction using Chatbot”, International Journal of Recent Technology and Engineering (IJRTE), 2020. [6] M.S Bennet Praba, Sagari Sen, Chailshi Chauhan, and Divya Singh “AI Healthcare Interactive Talking Agent using NLP”, International Journal of Innovative Technology and Exploring Engineering (IJITEE), 2019. [7] Akshay Mendon, Megharani Patil, Yash Gupta, Vatsal Kadakia, and Hash Doshi, “Automated Healthcare System Using AI Based Chatbot”, Springer Nature Link, 2023. [8] Shahadat Uddin, Arif Khan, Md Ekramul Hossian and Mohammad Ali Moni, “Comparing different supervised machine learning algorithms for disease prediction”, Springer Nature Link, 2019. [9] G Battineni, N Chintalapudi, and F Amenta, “AI chatbot design during an epidemic like the novel coronavirus”, Healthcare, 2020. [10] S. R. Dammavalam, N. Chandana, T. R. Rao, A. Lahari, and B. Aparna, “AI Based Chatbot for Hospital Management System”, 3rd International Conference on Computing, Analytics, and Networks (ICAN), Rajpura, Punjab, India, 2022. [11] Prakhar Srivastava, Nishant Singh, “Automatized Medical Chatbot (Medibot)”, IEEE Xplore, 2020 [12] K. Srivastava, T. N. Pandey, D. Roy, and S. Sahoo, “A Machine Learning Model on Healthcare Based Chatbot and Appointment System”, 3rd International Conference on Artificial Intelligence and Signal Processing (AISP), Vijayawada, India, 2023. [13] K. Anjum, M. Sameer, and S. Kumar, \"AI Enabled NLP based Text to Text Medical Chatbot,\" 2023 3rd International Conference on Innovative Practices in Technology and Management (ICIPTM), Uttar Pradesh, India, 2023. [14] P. S., N. Balakrishnan, K. T. R., A. J. B., and D. S., \"Design and Development of AI Powered Healthcare WhatsApp Chatbot,\" 2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN), Vellore, India, 2023. [15] N. A. I. Omoregbe, I. O. Ndaman, S. Misra, and O. O. Abayomi-Alli, \"Text messaging-based medical diagnosis using natural language processing and fuzzy logic,\" Journal of Healthcare Engineering, 2020.
Copyright © 2025 D Kavya, D Kiran Kumar, Divya Anjali M, Ganesh A P. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET66243
Publish Date : 2025-01-02
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here